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GAP comparison of sequences in WO 2004/003013 application. 



Polynucleotides - Percent Sequence Identity 



SEQ ID 


CQ760237 


CQ760239 


CQ760241 


1 


72.694 


52.193 


51.695 


3 


72.767 


52.852 


52.119 


5 


72.767 


52.281 


51.342 


7 


72.840 


52.535 


51.412 


9 


73.152 


50.797 


51.133 



Polypeptides - Percent Sequence Identity 



SEQ ID 


CQ760237aa 


CQ760239aa 


CQ760241aa 


2 


84.615 


45.946 


44.965 


4 


85.055 


45.946 


44.965 


6 


85.275 


45.270 


44.836 


8 


85.275 


45.495 


44.836 


10 


85.388 


45.902 


45.122 
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GAP of: CQ760237 check: 1467 from: 1 to: 1377 

WPDEF 1121 RuvB NCBI CQ760237 from WO2004/00313 
1121 RuvB NCBI CQ760237 from WO2004/00313 

LOCUS CQ760237 1377 bp DNA linear PAT 17-APR-2005 

DEFINITION Sequence 4 from Patent WO2004003013 . 
ACCESS ION CQ7 60237 

VERSION CQ760237.1 GI:44903794 . . . 

to: 1121SID3 check: 4044 from: 1 to: 1912 
WPDEF Case 1121 SEQ ID NO: 3 RuvB 

Symbol comparison table: nwsgapdna . cmp CompCheck: 8760 



Gap Weight : 
Length Weight : 



50 
3 



Average Match: 10.000 
Average Mismatch: 0.000 



Quality: 10020 
Ratio: 7.277 
Percent Similarity: 72.767 



Length: 1912 
Gaps : 0 
Percent Identity: 72.767 



Match display thresholds for the alignment (s) : 



| = IDENTITY 
: = 5 
. = 1 



1121CQ760237 x 1121SID3 



October 4, 2005 17:43 



1 atggagaaagtaaaga 16 

mi i ii ii 

51 ctccacagaaacagagagcgcataaccggcggcgttggcggcgatgagga 100 



17 ttgaagaaattcagtccaccgctaagaaacaacggattgctactcacacc 66 




tcgaggaggtgcagtcgacctcgaagaagcagcgcatcgccacccacacc 150 



67 catatcaaaggccttggcctcgagccaactggtatccctataaaattggc 116 



151 cacatcaagggactcggcctcgacgccaatgggatggcgattgcgttggc 200 




117 agctggatttgttggtcaacttgaggctagagaggcagctggtcttgtag 166 



201 ggcggggttcgtgggccaggcggcggcgcgcgaggcggccgggctggcgg 250 




167 
251 



ttgacatgattaagcagaagaaaatggcgggcaaggctcttttgcttgct 

I MINIMI llllllll IIIII III II I I IIIII 

tcgacatgattcgccagaagaagatggccggccgcgcggtgctccttgcg 



216 
300 
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217 ggacctcctggaactgggaaaacagctttggctcttggaatctctcaaga 266 

II II II I II II II II II I II II II II I II II 

301 ggtccgcccgccacgggcaagacggcgctagcgctcggcatagcccagga 350 

267 gctgggaagcaaggttccattctgtccaatggttggatctgaggtttact 316 

Ml II MINIM II MINIM 1 1 1 1 1 INN II II INI 

351 gctcggcagcaaggtccctttctgtcctatggtaggatcagaagtgtact 400 

317 catcagaggttaagaaaacagaggttctcatggagaattttagacgtgcc 366 

I II I II 1 1 llllllll INN II II 1 1 1 INN I I II 

401 cctcggaggtcaagaaaactgaggtgctgatggaaaatttccgtagagct 450 

367 attggtctacgtatcaaggaaaccaaagaagtctatgaaggggaggtcac 416 

II IN I INN lllllll MINI II llllllll INN II 

4 51 ataggtttgcgtataaaggaaaacaaagaggtttatgaaggagaggttac 500 

417 cgagctgtcaccagaagaaactgaaagcctcactggaggttatggtaaaa 466 

II II II llllllll INI II INN II INI INI 

501 tgaactttccccagaagaggctgagagtacaactggtggatatgcaaaaa 550 

467 gcatcagccatgttgtaattacactcaagacagtcaaaggaaccaaacat 516 

INI llllllll I II I I INN II INN II II II 

551 gcattagccatgtaatcatcagcttaaagactgttaaagggactaagcaa 600 

517 ctgaaattggatcccactatctatgatgccttgattaaggaaaaggtagc 566 

INN II III I I II llllllll INI NINIINN II 

6 01 ctgaagttagattcttcaatttatgatgctctgatcaaggaaaaggtggc 650 

567 tgtaggagatgtaatctatatcgaagcaaacagtggagctgtcaaacggg 616 

II II 1 1 1 1 1 II II II INI II IN llllllll II III I I 

651 agtgggtgatgttatatacatcgaagcaaatagtggagcagtgaaaagag 700 

617 taggtagaagtgatgcttttgccactgaatttgatctggaagcagaagaa 666 

I WWW INN lllllll II INI INN INN INN 

701 ttggtagatgtgattcttttgctacagaatacgatcttgaagctgaagag 750 

667 tatgttccacttcccaaaggagaggtccacaaaaagaaagagatagtgca 716 

llllllll I llllllll II INN II II INN II INN 

751 tatgttcctatccccaaaggtgaagtccataagaaaaaagaaattgtgca 800 

717 ggatgtcacactccaagatctggatgcagcaaatgctcgacctcaaggtg 766 

1 1 1 1 1 M 1 1 1 1 1 II II II NNIMIININN II lllllll 

801 ggatgtcacacttcatgaccttgatgcagcaaatgctcagccacaaggtg 850 

767 gccaggatatactttctttgatgggccaaatgatgaaaccgcggaagact 816 

INI INN I II I llllllll ' 1 1 1 1 M 1 1 I II MINI 

851 gccaagatattttgtcccttatgggccagatgatgaaaccacgaaagact 900 

817 gagatcactgataagcttcggcaagaaattaacaaggttgtgaaccgata 866 

II INN II II II II NINIINN INN II II INI 

901 gaaatcaccgaaaaactacgccaagaaattaataaggtggtaaatagata 950 
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867 tatagatgaaggtgtggcagagcttgttccaggagttctatttattgatg 916 

III II 1 1 1 I E I Mi II II III I II MMI 

951 tatcgatgaaggaattgcagagcttgtacctggtgttttgttcattgatg 1000 
917 aggttcatatgcttgatatggagtgcttctcatacttgaaccgtgctctt 966 

1 1 1 1 II III I Mill II II II II II I MMMM I 

1001 aggtccacatgttggatatcgaatgtttttcttatcttaaccgtgcattg 1050 
967 gagagctcattatctccgatagtgatatttgcaacaaatagaggtgtttg 1016 

MUM lllllll II II 1 1 1 1 1 1 MM MMMM II I II 

1051 gagagcccattatcaccaatcgtgatacttgctacaaataggggaatatg 1100 
1017 caacgtaagagggactgatatgcccagcccccatggagtccctattgatc 1066 

II MMMM MINIMI I II II Mill I II I MM 

1101 taatgtaagaggaactgatatgacaagtccacatggtataccggtggatc 1150 
1067 tattagatcgattggttatcatccggactcaaatctatgatccctctgaa 1116 

I Mill I MM II II Mill I I MM II MM 

1151 ttctagataggctggtgattattcggacagagacatatggccctactgag 1200 

• • • • • 
1117 atgatccagattatagccattcgtgcgcaagttgaagaattaaccgtgga 1166 

Mill Mill I II II II II Mill II II I MM 

12 01 atgatacagatattggctatccgagcacaagtggaggagattgatatgga 1250 
1167 tgaagaatgcttggttctacttggggagattgggcaaagaacttcactaa 1216 

lllllll I I I I I II Mill II II II II II 

12 51 tgaagaaagtcttgcttatttaggcgagatcggacagcagacatctttga 1300 
1217 ggcacgctgtgcagcttctgtctcctgccagcattgtagcgaaaatgaat 1266 

I II Ml III I I II MMMM III I II I Ml 

1301 gacatgctattcaattgatatcacctgccagcgtggtctcaaagactaat 1350 

• • • • • 
1267 ggccgtgacaatatttgcaaggctgatatagaggaagtaacatcactcta 1316 

II I II II II MINIMUM I MINIM I Mill 

13 51 ggaagagagaaaatctgcaaggctgatctcgaggaagtcagtgggctcta 1400 
1317 cttggatgctaaatcttcagcaaagcttttgcatgagcaacaagaaaaat 1366 

MMMM Mill II II III I II IIMMMIMM II 

14 01 tttggatgccaaatcctcggctcggctgctccaggagcaacaagaaagat 1450 
1367 acatctcatga 1377 

Mill II 

1451 acatcacctagatttggatctcctgtcgtggaagtctcgaagagaatgta 1500 
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GAP of: CQ760237aa check: 5361 from: 1 to: 458 

WPDEF 1121 RuvB NCBI encoded by CQ760237 from WO2004/00313 
1121 RuvB NCBI encoded by CQ760237 from WO2004/00313 
SEQ ID NO: 5 NCBI CAF32893 . 

LOCUS CQ760237 1377 bp DNA linear PAT 17-APR-2005 

DEFINITION Sequence 4 from Patent WO2004003013 . 
ACCESSION CQ760237 . . . 

to: 1121SID4 check: 8436 from: 1 to: 455 

WPDEF Case 1121 SEQ ID NO: 4 RuvB 

Symbol comparison table: blosum62.cmp CompCheck: 1102 
BLOSUM62 amino acid substitution matrix. 

Reference: Henikoff, S. and Henikoff, J. G. (1992). Amino acid 

substitution matrices from protein blocks. Proc. Natl. 

Acad. 

Sci. USA 89: 10915-10919. 

Gap Weight: 8 Average Match: 2.778 

Length Weight: 2 Average Mismatch: -2.248 

Quality: 1994 Length: 458 

Ratio: 4.382 Gaps: 0 

Percent Similarity: 90.110 Percent Identity: 85.055 

Match display thresholds for the alignment (s) : 
| = IDENTITY 
: = 2 
. = 1 

1121CQ760237aa x 1121SID4 October 4, 2005 18:01 .. 



1 MEKVKIEEIQSTAKKQRIATHTHIKGLGLEPTGIPIKLAAGFVGQLEARE 50 

I I I :| I U I I I I II I I I II I I I h |. I MINIM III 
1 . . . MRIEEVQSTSKKQRIATHTHIKGLGLDANGMAIALAAGFVGQAAARE 47 

51 AAGLWDM I KQKKMAGKALLLAGP PGTGKT ALALG I S QE LGS KVP FC PMV 100 

MM I II hi I II I MM I MM 1 1 1 1 1 1 1 1 1 1 - 1 E 1 1 1 1 1 1 1 1 1 1 1 

48 AAGLAVDM I RQKKMAGRAVLLAGPP ATGKTALALG I AQELGS KVP FC PMV 97 
101 GSEVYSSEV1CKTEVLMENFRRAIGLRIKETKEVYEGEVTELSPEETESLT 150 

1 1 f 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I E 1 1 1 i 1 1 1 1 1 1 1 1 1 M I 

98 GSEVYSSEVKKTEVLMENFRRAIGLRIKENKEVYEGEVTELS PEEAESTT 147 
151 GGYGKSISHWITLKTVKGTKHLKLDPTI YD ALIKE KVAVGDVIYIEANS 200 

Ml lllllhl-llllllll MM M 1 1 1 1 II M I IM 1 1 1 1 1 1 1 1 1 

148 GGYAKS I SHVI I SLKTVKGTKQLKLDSS I YD ALIKE KVAVGDVIYIEANS 197 
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201 GAVKRVGRSDAFATEFDLEAEEYVPLPKGEVHKKKEIVQDVTLQDLDAAN 250 

MINI UliMIHI IHMHIIIIIIIIII 1 1 1 1 1 1 

198 GAVKRVGRCDS FATEYDLEAEEYVP I PKGEVHKKKE I VQDVTLHDLDAAN 247 

251 ARPQGGQD I LS LMGQMMKPRKTE I TDKLRQE INKWNRYIDEGVAELVPG 300 

I Ml II I I III II I II I Mil II I MMM I II III I I Ml I MM I II I 
248 AQPQGGQD I LS LMGQMMKPRKTE I TEKLRQEINKWNRY I DEG I AELVPG 297 

• • • • • 
301 VLFIDEVHMLDMECFS YLNRALES SLSP IVI FATNRGVCNVRGTDMPS PH 350 

IIIIIIIIIII-IIIIIIIIIIII MINI 1 1 II I M 1 1 1 1 II I Ml 

2 98 VLF I DEVHMLD I ECFS YLNRALES PLS P IVI LATNRG ICNVRGTDMTS PH 347 
351 GVPIDLLDRLVIIRTQIYDPSEMIQIIAIRAQVEELTVDEECLVLLGEIG 400 

MIMIIIIIIIMI: I IMIIIIMIIIMIM -Ml I Mill 

348 GIPVDLLDRLVIIRTETYGPTEMIQILAIRAQVEEIDMDEESLAYLGEIG 3 97 

• • • • • 
401 QRTS LRHAVQLL S PAS I VAKMNGRDNI CKAD I EEVTS LYLDAKS S AKLLH 450 

MllllhlhlllhM llh MllhllM I II I I I I I I : I I 
398 QQTS LRHAI QL I S P AS WS KTNGRE K I CKADLEE VS GLYLDAKS S ARLLQ 447 

451 EQQEKYIS 458 

lllhll- 
448 EQQERYIT 455 



